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Abstract 

We show that quantum algorithms can be used to re-prove a classical theorem in approximation 
theory, Jackson's Theorem, which gives a nearly-optimal quantitative version of Weierstrass's Theorem on 
uniform approximation of continuous functions by polynomials. We provide two proofs, based respectively 
on quantum counting and on quantum phase estimation. 

1 Introduction 

In many mathematical contexts it is convenient to approximate complicated objects by simpler ones. A typ- 
ical example is the approximation of arbitrary continuous functions on a closed interval by polynomials 14 . 
Weierstrass's Theorem [16] states that this can always be done. More precisely, the real polynomials arc 
"dense" in the space C[0, 1] of continuous real functions on the unit interval: for every g &C[0, 1] and e > 0, 
there exists a polynomial p(x) such that \p(x) — g(x)\ < e for all x £ [0, 1]. 

Bernstein [3] gave a simple and elegant probabilistic construction of such approximating polynomials, 
which can be described as follows (see also [U after Chapter 7]). Given a function g G C[0, 1] that we want 
to approximate, fix an n > 1 and flip n independent coins, each coming up T' with probability x. Count 
the Hamming weight \w\ of the resulting string w € {0, 1}™, and output g(\w\/n). Note that the expected 
value of \w\/n is exactly x, and with high probability we'll have \w\/n — x ± 0(1/ 'y/n). But then the output 
g(\w\/n) should usually be a good estimate of g(x). Indeed, consider the expected value of the output of this 
algorithm, as a function of x: 

n / n \ 

B g , n (x) := E w [g(\w\/n)} = £ %L x k (l - x) n - k g{k/n). 



k=0 



This B g n is a polynomial in x of degree n. Since \w\/n is probably close to x, we intuitively expect B g>n (x) 
to be close to g(x), provided g does not fluctuate too much on intervals of width 1/y/n. To capture this 
fluctuation, define the modulus of continuity of g at scale d as 

u s (g) := sup \g(x)-g(y)\. 

x,y:\x—y\<S 

This is a measure of the "smoothness" of g: the lower the value u)g(g), the more "smooth" g is and the 
smaller our approximation error should be. Now an easy argument shows that for every x € [0, 1] we have 

\B g ,n(x) - g(x)\ = 0(u 1/VH (g)), 

confirming the above intuition. 

It is possible to reduce the error of approximation much further. An improvement of Bernstein's result 
was shown by Jackson |10j . Using trigonometric ideas, he proved 
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Theorem 1 (Jackson) There exists a universal constant C , such that for every g 6 C[0, 1] and positive 
integer n, there is a degree-n polynomial p satisfying \p(x) — g{x)\ < Cu)i/ n (g) for all x € [0, 1]. 

This quality of approximation is based on the maximum fluctuation of g at a much smaller scale than 
Bernstein's (1/n instead of 1/y/n). Up to the constant factor, Jackson's Theorem is optimal for approxima- 
tion guarantees based on the modulus of continuity. Several different proofs of the theorem are known, see 
for instance [7] . 

In this paper we show how one can implement Bernstein's idea with a quantum algorithm, improving its 
error bound to the one in Jackson's Theorem. Our idea is quite simple: we replace Bernstein's "algorithm" , 
which basically counts the number of ones in a bitstring of n coin flips, with a quantum counting algorithm. 
It was shown by Brassard et al. [5] that quantum algorithms can perform (approximate) counting more 
efficiently than classical algorithms, and this yields an improvement over Bernstein's approach. 

Let us sketch our proof strategy in somewhat greater detail. To begin, we perform N — n 2 x-biased coin 
flips instead of n, yielding a string w £ {0, 1}^. Now we have |iu|/iV = x ± 0(l/y/~N) = x ± 0(l/n) with 
high probability, so |to|/iV is a much more precise estimator of x than the \w\/n of Bernstein's proof. We 
then run a quantum counting algorithm making n/2 "quantum queries" to w. This algorithm computes an 
estimator A of \w\/N, such that with high probability 

\A- \w\/N\ < 0{l/n). 

Accordingly, with high probability A also approximates x within error 0{l/n). Then intuitively the function 

Q 9 ,n(x) := E w , A [g(A)} 

should approximate g up to error roughly Wx/ n {g). The expectation in the above expression is over both the 
choice of w, and over the randomness generated by the measurements in the quantum counting algorithm 
that runs on w and outputs A. 

It is well known that the acceptance probability of an n/2-query quantum algorithm is an A-variatc 
multilinear polynomial (in the bits of w) of degree at most n (see [3]). Taking the expectation over w turns 
this into a univariate polynomial in x of degree at most n, since the expectation of a monomial Wi 1 ■ ■ ■ Wi d is 
exactly x d . Working out the details (Section [3]), Q g ,N is indeed a degree-n polynomial that approximates g 
within error bound 0{u>i/ n {g)). This reproves Jackson's theorem. 

As we explain in the next section, efficient quantum counting relies on a more basic procedure called 
quantum phase estimation [11 . In Section |4] we give a second proof of Jackson's Theorem that is directly 
based on phase estimation. This naturally yields a proof of the trigonometric version of Jackson's Theorem 
(Theorem [2] below), but in fact the two versions are equivalent (see |14|). A trigonometric polynomial of 
degree n can be defined in two equivalent ways: as a sum of the form p(x) = Y^k=- n a ke 2vtxk , or as 
a sum of the form p{x) = ao + Ylu—i (afc cos(27rfcx) + b^ sin(27rfcx)), where ak,cik,bk are scalars, possibly 
complex^ A trigonometric polynomial is real if it maps real numbers to real numbers, or equivalently, if 
the coefficients a^, are all real. Note that a trigonometric polynomial is a 1-periodic continuous function. 
The trigonometric polynomials are dense in the set of all such functions, and the trigonometric version of 
Jackson's Theorem gives a quantitative refinement of this fact: 

Theorem 2 (Jackson, trigonometric version) There exists a universal constant C, such that for every 
1-periodic function g €E C\M] and positive integer n, there is a degree-n real trigonometric polynomial p 
satisfying \p(x) — g(x)\ < Cu>i/ n (g) for all 

The best constant was determined by Korneichuk, who showed that every 1-periodic g £ C[R] can be 
approximated by a degree-n trigonometric polynomial with error e < 0Ji/2 n (h), which is essentially optimal^) 

1 The equivalence of the definitions follows from Euler's formula e l ® = cos £> + i sin £>. 

2 By [121 Theorem 6.2.2], every 27r-periodic h £ C[M] can be approximated by a degree-n trigonometric polynomial with error 
s < ui^^ n (h); we have restated this here for 1-periodic functions. The error bound is essentially optimal: for every n and a > 0, 
there is a 27r-periodic function h such that every degree-n polynomial differs from h by at least (1 — l/2n — otju)^ / n (h) (see 1 1 21 
Lemma 6.2.3]). 
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Though the ideas used in our two approaches to Jackson's Theorem are closely related, we feel both have 
merit. The one based on quantum counting (Section [3]) is a quantum generalization of Bernstein's proof, 
while the one based on phase estimation (Section^) is a more "direct" approach since phase estimation is the 
basis for quantum counting. The phase estimation approach is in fact closely related to Jackson's original 
proof, as we explain in Section [5] 

Finally, let us mention that this paper fits in a sequence of recent applications of quantum computational 
techniques establishing or casting new light on results that have nothing to do with quantum computing 
itself. We refer to [5] for a survey. 



2 Preliminaries: Phase estimation and quantum counting 

Here we sketch how phase estimation works and how it can be used to do approximate quantum counting. 
The presentation is based on Brassard et al. [5J; the phase estimation algorithm is due to Kitaev [TT] . 



2.1 Phase estimation 

Suppose we can apply a certain unitary U as often as we want, and we are given one of C/'s eigenvectors, 
\u), with unknown eigenvalue e for x G [0,1). We would like to learn x. Quantum phase estimation 
allows us to approximate x up to any desired precision, as described below. It will be convenient to define 
the following distance for approximations: d(x,x) := min c gz |c+a;— x\ G [0, 1/2], so 2ird(x,x) is the shortest 
distance along the unit circle from e 27 ™ x to e 27Ttx . 

We start with a 2-register quantum state, where the first register contains the uniform superposition over 
M basis states, and the second contains the eigenvector \u): 



M-l 

== E itf>®i*>- 



M-l 

M 

y=o 

Now, conditioned on the first register's value y, apply U to the second register y times, i.e., map \y) ® \u) i-> 
e 2nixy \y) ® \u). This gives 

M-l 
v y=o 

From now on we ignore the second register. Apply the inverse quantum Fourier transform over Xm to get 

M-l M-l M-l M-l 

<?™vJ— ^2 e -^ iyz ' M \z) = ]T a z \z), where a z = — ^ e ^iyd{z/M,,) _ 



v y=0 v z=0 z=0 y=0 

If we measure this, we get a random variable Z G {0, . . . , M — 1} with distribution 

2 f 1 if d(z/M, x) = 0, 

Pr[Z = z] = \a z \ =i si n{Md(z/ALx)^ ntliprwi<5P W 

{ m 2 sin(<i( z /M,a : )7r) 2 otnerwise, 

where we used the identities Eyio 1 r y = (1 - r M )/(l - r) and |1 - = 2| sin(</)/2) | . Note that the first 
case is just the limit of the second case as d(z/M,x) — > 0. The distribution Pr[Z = z] is peaked at values 
of z where d(z/M,x) is small, so we can use X :— Z/M as our estimate of x. The probability of outcome 
X = x falls off quadratically with its distance from x: if x ^ x then (using sin(</>) > 24>/tt for <f> G [0,tt/2]) 

Pr[X = ~x ] =Vr[Z = xM ] < I ^ ;w . (2) 

For convenience, we let 1/0= oo and consider Eq. ([2]) to hold vacuously when d(x, x) = 0. 
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2.2 Quantum counting 

Suppose we have an input w € {0, 1}^ whose Hamming weight \w\ we want to estimate, using a quantum 
query algorithm that can access w by means of the query operator O w , which maps \i) > (—l) Wi \i). Then 
we can perform approximate counting using phase estimation as follows. 

For simplicity assume N — 2 n . Define a unitary U = —H® n OQH® n O w , where Oo is the unitary that 
puts a '— ' in front of the all-0 state, and H is the Hadamard transform. This U is known as the Grover 
iterate, and is the crucial ingredient in the quantum search algorithm jSJ|B]. Let = — ^ j Y]j. Wi —i \t) and 

|5» ) = —j= | | ^2j- Wi= o I*) be the uniform superpositions over the 1-bits and the 0-bits of w, respectively. 
One can show |6| that U has the following two orthogonal eigenvectors, with corresponding eigenvalues: 

|tf +) = + i|¥ » with A+ = e™, 

|*_) = (|*i> - i|*o)) with A_ = e- 2W , 

where 9 = arcsin( v / |w|/7V) e [0,tt/2]. 

Note that \w\/N = sin(#) 2 . We want to estimate 9 by means of phase estimation on U. For this we would 
need an eigenvector of U with eigenvalue related to 9. We cannot easily construct one of the two eigenvectors 
1^+) and I*-). However, the uniform superposition 

i x 1 i -x / \w\ . . IN — \w\ . . 

v i— 1 

is a linear combination of \^+) and |^_). This \u) is independent of w and easy to construct. We can analyze 
this as if the second register contains a mixture of the two eigenvectors. Thus, doing phase estimation on 
U with starting vector \u), we will be estimating either 9 or —6. Since sin(6>) 2 = sin(— 9) 2 , we don't care 
whether we estimate 9 or —9 (assume the first one for simplicity). Phase estimation with \u) as starting vector 
produces a random variable Ze {0, . . . ,M — 1}, such that the distribution of Z/M is peaked around 9/n. 
Accordingly, we use 9 :— ttZ/M as our estimate of 9, and A := sin(9) 2 as our estimator of \w\/N — sin(6*) 2 . 
Note that the number of queries of this procedure is the number of applications of U, which is M— 1. Eq. ([2]) 
gives a tradeoff between the number of queries and the error in our approximation of 9, which translates 
into an error in our approximation of \w\/N. Brassard et al. [6] work out various points on this tradeoff in 
detail. 



3 Jackson's Theorem by quantum counting 

In this section we provide the details of the idea sketched in the Introduction, combining Bernstein's proba- 
bilistic approach with a quantum counting algorithm. We first present a direct approach that does not quite 
work; we then explain how a simple modification allows the proof to go through. 

We want to construct an approximating polynomial of degree n for a given continuous function j 6 C[0,1]. 
We start by setting N := n 2 and letting w € {0, 1}^ be a string obtained by flipping N coins, each with 
probability x of T'. Since its Hamming weight \w\ is binomially distributed with expectation xN and 
variance x(l — x)N, we have E[|i£ — |tw|/JV|] < ^/Var[|w|/iV] = yjx(l — x)/N = 0(l/n). We now want to 
estimate |iy|/AT using quantum counting. As explained in Section^ we define 9 := arcsin( y/\w\ /N) , so that 
sin(6*) 2 = \w\/N. Letting M > 1 be an integer to be fixed later, quantum counting with M — 1 queries 
produces a random variable Z £ {0, . . . , M — 1}, such that by Eq. @, 



~ 4M 2 d(z/M,9/Tr) 2 ' 
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where d(-,-) is as defined in Section |2~T1 Since Z/M is concentrated (with respect to the distance d(-, •)) 
around 9/tt, we can use 9 :— irZ/M as our estimate of 9, and A := sin(#) 2 as our estimate of \w\/N. We 
claim that the angles 9, 9 are likely to be close "modulo tt" : we have 



2 ^ 1 



E[d(0/7T, 0/tt)] = J] = ^] • 9/n) <—+ £ 



M ^ AM 2 d(z/M,9/n) 

|z-A/e/ir|>l 



,1/ 



The function /(t) = sin(i) 2 is 7r-periodic and its derivative satisfies |/'(i)| = | sin(2i)| < 1. Thus 



| sm(9) 2 - sin(6») 2 | < tt • d(9/ir, 9/ir), (4) 

and hence also 

E[|A- \w\/N\] = E[|sin((9) 2 -sin(6l) 2 |] < tt • E[d(9/ir, 9/ir)} = 0((\og M)/M). (5) 
Define a function p : [0, 1] — > M by 

p(x) := E WtA [g(A)} = ^E w [PrL4 = a \ w]] ■ g(a), (6) 

a 

where Pr[u>] = x 

M(l _a;)-tf-M depends on a;, and Pr[A = a | w] is the probability that quantum counting 
on input w yields estimate a for |n;|/iV. The following lemma from [3] implies that for any fixed value a, 
Pr[A = a | w] is a low-degree polynomial in the N variables of w: 

Lemma 3 (BBCMW) Consider a quantum algorithm that makes at most T queries to w € {0, 1}^ and 
outputs the result of a measurement on the final state. Then the probability Pr[A = a \ w] of any specific 
output a is an N-variate multilinear real polynomial in w of degree at most 2T. 

Each polynomial Pt[A — a \ w] is a linear combination of monomials in w± , . . . , wn, of degree at most 2T. 
A degiee-d monomial Wi 1 ■ ■ ■ Wi d has expectation E w [wi 1 ■ ■ ■ Wi d ] = Yij=i ^tul^i,-] = x<i > which is a polynomial 
in x of degree d. Hence each expression E„,[Pr[A = a \ w]] in Eq. (J6j) is a polynomial in x of degree at most 
2M — 2. Then the function p defined in Eq. ([6]) is itself also a polynomial in x of degree at most 2M — 2. 
Choosing M := n/2 + 1, p has degree at most n. 

Next we argue that p approximates g fairly well. First, recall the definition of ujs(g) from the Introduction; 
we have the elementary property ws+S'ig) < ws(g) + an d therefore uj c $(g) < |~c] • u)&(g). Now, for 

every x £ [0, 1] we bound 

\p(x)-g(x)\ < E WiA [\g(A) - g(x)\] 



< 
< 
< 
< 



E Wt A[u\A-x\(j9)] 

E w ^A[u\ A -\w\/N\(g) + UJ \x-\w\/N\(g)] 

E W , A [\\A - \w\/N\ • (n/ logn)]] u logn/n (g) + E w [\\x - \w\/N\ ■ n\]u 1/n (g) 
O(ui ogn / n (g) + u x/n (g)) 



< O{uj logn/n (g)), (7) 

where in the penultimate step we used that E[|A— |u>|/-/V|] — 0((\ogn)/n) and E[|x— |io|/JV|] = 0(l/n). The 
error bound we derived is already substantially better than Bernstein's bound of 0(^1/^(3)), but worse 
than Jackson's optimal error bound 0(w\/ n (g)) by a logarithmic factor. To get rid of this factor, we want 
an estimate of |u;|/JV with expected error 0(1/ n) instead of O ( (log n)/n). 
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Taking an algorithmic perspective, it is easy to see how to get a more sharply concentrated estimate. 
We run the quantum counting procedure three times, yielding estimates Z\, Zi, Z3 for 9M/ir. These yield 
estimates 9i := ttZi/M for 9, for i — 1,2,3. We then get estimates A\,A^, A3 for \w\/N, by taking Ai :— 
sin(&i) 2 . We let A' := med(Ai, A2, A3) be defined as the median of these three estimates, and use A' as 
our improved estimate for |to|/JV. In order to keep the total number of queries at most n/2 (and hence the 
degree at most n), we will now spend M :— n/6 queries for each of the three runs of quantum counting, 
assuming for simplicity that 6 divides n. 

We now show that this approach yields a tighter estimate. First, it can be easily verified that for real 
variables a, b, c, t, 

I med(a, b, c) - t\ < med(|a - t\, \b - t\, \c - t\). (8) 

It follows that \A' — [w[/N[ is at most the median of \Ai — \w\/N\, over i — 1,2,3. Using Eq. (0| applied to 
each of the 9i, we obtain \Ai — \w\/N\ < 7r • d{9i/ r K, 9 jix). If we define a random variable d mec i as the median 
of the quantities d(9i/it, 9/ir), for i = 1, 2, 3, then we have 

\A' - \w\/N\ < ir ■ d med . (9) 

We now bound E[d me d]- Fix an integer k > 1. For d m ed & [k/M, (k + 1)/M) to occur, it is necessary that 
some Zi satisfies d(Zi/M,9/n) G [k/M, (k + l)/M), and some Zj with j ^ i satisfies d(Zj /M,9 /%) > k/M. 
There are at most two possible values z € {0, 1, . . . , M — 1} satisfying d(z/M, 9/ir) g [k/M, (k + 1)/M), and 
there are just 6 possible pairs i,j. Thus, we have 

Pr[d med e [k/M,(k+l)/M)\ < 
6 • Pr [d{Z 1 /M, 9/ir) E [k/M, (k + 1) /M)] ■ ^ Pr[Z 2 = z'\ 

z : 

d(z' /M,Q/ir)>k/M 



< 



< o 

It follows that 

E[d med ] <^ + E^7 i -0fi)=^-0(l + E4]=0 (1/M) = 0(l/n), 



M ^ M \ k 3 J M \ ^ k 2 

k>l v 7 \ k>l 

and by Eq. © we get E[\A' - \w\/N\] = O (1/n). 

We redefine the polynomial p as p(x) := K w ^ A >[g(A')]. By our revised setting M = n/6, p(x) is a 
polynomial of degree < n. Following the steps of Eq. ([7]) with appropriate changes, we have 

\p(x) - g(x)\ < E WjA ,[u\ A ,_\ w \ /N \(g) + u\ x -\ w \/ N \(g)] 

< E WtA , [\\A' - \w[/N\ ■ n\]u 1/n (g) + 0(u 1/n (g)) 
<0(u 1/n (g)). 

This proves Theorem [1] 



(i 



4 Jackson's Theorem by phase estimation 



As explained in Section [2] quantum counting is based on quantum phase estimation. In this section we 
describe an alternative way to prove (the trigonometric version of) Jackson's theorem. We will no longer 
estimate the weight of a string of x-biased coin flips, but instead apply quantum phase estimation directly 
to a unitary which "encodes" x. 

Suppose we apply phase estimation to the lxl unitary U — [e 27Vlx ] to estimate x. Since U is 1-dimensional, 
every vector \u) is an eigenvector with eigenvalue e 2 ™. As explained in Section [2] using up to M — 1 
applications of U, phase estimation produces a Z € {0, . . . , M — 1}, distributed as in Eq. ([1]), so that Z/M is 
concentrated around x (with respect to the distance d(-, ■)). Let M := n/3 + 1, assuming for simplicity that 
3 divides n. Suppose we apply phase estimation three times, getting outcomes Z\, Z2, Z3 £ {0, ...,M— 1}. 
Define Y as the median of g(Zi/M) over i — 1,2, 3; we use Y as an estimate of g(x). Note how we now take 
medians after applying g, in contrast to our previous approach]^ 

In order to approximate the continuous 1-periodic function g € C[R], consider the quantum algorithm 
that produces an estimate Y as above, using the unitary U — [e 2mx ]. We define p(x) :— E[Y]. Referring back 
to our description of phase estimation, we see that the amplitudes of the final state of our quantum algorithm 
are linear combinations of e 27rMfc for k 6 {0, . . . , 3(M — 1)}. Hence the final measurement probabilities are 
trigonometric polynomials in x of degree 3(M — 1) = Accordingly, p is such a polynomial as well. 

Since the distribution of each Zi is concentrated around x (with respect to d(-, •)) and g is continuous 
and 1-periodic, we expect p{x) to be close to g{x). We make this precise next. For each i, the definition of 
u)s(g) and the fact that g is 1-periodic implies that 

\g(Zi/M) - g(x)\ < \d(Zi/M, x) ■ n] Lo 1/n (g). (10) 

Eq. © implies that \Y — g{x)\ is at most the median of \g(Zi/M) — g(x)\, over i — 1,2,3. By analogy 
with the previous section, define the random variable d m ed as the median of d(Zi/M,x), i = 1,2,3. From 
Eq. ([10)) . we obtain 

\Y - g(x)\ < \d med ■ n] u\/ n (g). (11) 
Reasoning identical to that of the previous section yields E[d me d] = 0(1/ M) = 0(1/ 'n), and therefore 

\p(x)-g(x)\<E[\Y-g(x)\] 

< E[\d med , ■ n]]cj 1 / n (g) 
= 0(uj 1/n (g)). 

This proves Theorem [51 the trigonometric version of Jackson's Theorem. 

5 Relation to classical proofs 

Our approach to Jackson's Theorem in both sections bears strong similarities with classical proofs. The 
first approach, using quantum counting, was modelled on the probabilistic interpretation of Bernstein's 
polynomial approximation. The second approach, using phase estimation, also has a very close relation to 
a classical technique in approximation theory, the method of convolution with an approximation kernel j!5) . 
This method was employed by Jackson in his original proof |10) . We explain this method and its relation to 
our proof next. 

Suppose K (t) £ C(M.), the "approximation kernel," is a nonnegative, 1-periodic function concentrated 
around zero (mod 1), and such that J Q K(t)dt = 1. Then for any x € K, the function K x (t) :— K(t — x) is 

3 We could have taken medians after applying g in Section [3] as well, but we wanted to keep a conceptual focus on obtaining 
a sharp estimate of the quantity \w\/N (in analogy with Bernstein's proof). In the phase estimation approach, sharpening our 
estimate of x would be slightly more involved, and taking medians after applying g yields a more streamlined proof. 

4 While the amplitudes are linear combinations of e 2 ^ txk for k £ {0, . . . , 3(M — 1)}, the probabilities are sums of amplitudes 
times their conjugates, and hence are linear combinations of e 27 " xk with k £ {— 3(Af — 1), ... , 3(Af — 1)}. 
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concentrated around x (mod 1). This suggests that, for a 1-periodic function h(t) € C(K), the convolution 
of h with K, i.e., 

(h*K)(x):= h(s)K(s-x)ds, 
Jo 

should give a good approximation to h(x). As for degree considerations, a key fact is that, if K is a degree-n 
trigonometric polynomial, then so is (h * K) (see, e.g., Chap. 3]). Finally, note that the expression 
defining (h * K){x) is exactly E[/i(x)], when x £ [0, 1) is an approximation to x drawn according to the 
density function K x . Thus the convolution approach resembles the phase-estimation approach (with a single 
application of phase estimation), except that an estimator with a continuous distribution is used0 

In fact, even the distribution of the estimates we derive from phase estimation is intimately related to 
the approximation kernels used in Jackson's original proof [TUl [13j [7]. A classical approximation kernel is 
the Fejer kernel F n : R — > K (n > is an integer parameter), given by 

1 / sin(7rnt) x 2 



Fn(t) = ~ . 

n \ sm(7rr) 

with F n (0) = n for continuity. Note that F n as defined here is 1-periodic (it is usually defined in a 2-7r-periodic 
form). Also, F n can be re-expressed as a trigonometric polynomial of degree n — 1. Now we compare this to 
our analysis of phase estimation in Section^ Consulting Eq. ([1]), if the unknown eigenvalue e 27Ttx (x £ [0, 1)) 
satisfies Mx ^ Z, then the estimate y = Z/M to x produced by phase estimation (z £ {0, . . . , M — 1}) is 
distributed as 

We used the properties of the sine function and the definition of d(-, •) to get this equivalent form from 
Eq. ([T|). Hence y is distributed as a discrctized, rcnormalizcd Fejer kernel re-centered at x. (If Mx £ Z, then 
Pi[y = x] = 1.) 

Recall that our proof required a sharper estimate than that given by plain phase estimation — we had to 
apply phase estimation three times. Similarly, the convolution g * F n of g with the Fejer kernel fails to give 
a sufficiently close approximation to g to prove Jackson's Theorem. A sharper approximation is provided by 
the so-called Jackson kernel, obtained by squaring and renormalizing the Fejer kernel: 

J n (t) = cFl{t), (12) 

where c > is chosen so that J n (t)dt = 1. Jackson showed that \(g * J n )(x) — g[x)\ = 0(u>i/ n (g)) for 
all x £ [0, 1]. This (g * J n ) is a trigonometric polynomial of degree 2(n — 1), which can be reduced to 
degree < n by using </[n/2j m place of J n . Note how squaring and renormalizing the Fejer kernel has the 
effect of sharpening its concentration; this is somewhat analogous to the sharpening we achieve by taking 
medians-of-three in our quantum algorithms aboveH 
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made formal. 

6 We note that, if {p(z)}o< z <M -i ls a distribution describing the outcome of a specific quantum measurement on a state \<j>), 
and if q(z) = cp(z) 2 where c > is a normalizing factor, then the distribution q(z) also has a somewhat natural interpretation in 
quantum terms. Namely, suppose that we perform the measurement twice, on two identical (unentangled) copies of \<p), yielding 
outcomes z\,Z2, and condition on the event that Z\ = Zq,. Then q(z) describes the probability that Z\ = Z2 = z conditioned on 
Zl = Z2- This can also be phrased in the language of quantum postselection pQ, as we are postselecting upon the event zi = Z2- 
The renormalizing constant c in Eq. i l 1 21 ) is the reciprocal of the probability of this event. 
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